keywords:"lexicon" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"lexicon"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Sentiment Analysis in Automotive Industry Bezák, Adam ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor) The main theme of this thesis is to familiarize with the basic methods of sentiment analysis on social networks. Thesis’s theme is aimed on the automotive industry, although this prinicipal can be used in any different examined branch. The basis of the practical part is to obtain data from the social networks, analyze them and then index them into ElasticSearch database. Another goal of the thesis is to visualize these data by means of a web portal. Created web portal provides various statistics of the leading automobile brands, an overview of new trends or the aspect visualization of the individual cars. Detailed record
	Sociolinguistic analysis of communication of men and women Servusová, Carolina ; Jančík, Jiří (advisor) ; Suková Vychopňová, Kateřina (referee) he Bachelor thesis deals with the topic of interpersonal communication. The bachelor's thesis aims to focus on communication and expression differences between male and female genders. It summarizes the most important findings from this field, from literature and scientific articles. Our focus is also in contextualising our work in socio-linguistic fields, dealing with society, language and communication processes. A secondary objective is also to determine whether any expressive differences are innate or learned. The Bachelor thesis is divided into Chapter 7 chapters. In the first chapter, we deal with language and communication. We outline the first differences in communication. In the next three chapters, we focus on scientific disciplines dealing with the differentiation of society, but also with language and communication. These are sociolinguistics, psycholinguistics and anthropolinguistics. In other chapters, we are dealing with language production and the gender heterogeneity of speech. We look for differences and influences that affect our speech and communication. In the last chapter we focus on qualitative research, observation, which is focused on several dozen communication situations mostly from French backgrounds. We are trying to find the differences that we have searched in the... Detailed record
	Slang in Football with Focusing on Anglicisms in the Sport Daily Pabousek, Jiří ; Vlčková, Jana (advisor) ; Hirschová, Milada (referee) Detailed record
	Qualitative analysis of professional terminology in specialized fields Homolka, Adam ; Zmrzlá, Petra (referee) ; Šťastná, Dagmar (advisor) Cílem bakalářské práce je charakterizovat odborný styl (rysy, funkce, forma, syntax, lexikum) a následně shrnout teoretické poznatky z oblasti překladu odborného stylu týkající se terminologie a překladatelských postupů na úrovni lexikální a gramatické. Praktická část bakalářské práce se zaměří na aplikaci teoretických poznatků v rámci kvalitativní analýzy vybraných odborných textů. Detailed record
	Automatic dictionary acquisition from parallel corpora Popelka, Jan ; Pecina, Pavel (advisor) ; Mareček, David (referee) In this work, an extensible word-alignment framework is implemented from scratch. It is based on a discriminative method that combines a wide range of lexical association measures and other features and requires a small amount of manually word-aligned data to optimize parameters of the model. The optimal alignment is found as minimum-weight edge cover, selected suboptimal alignments are used to estimate confidence of each alignment link. Feature combination is tuned in the course of many experiments with respect to the results of evaluation. The evaluation results are compared to GIZA++. The best trained model is used to word-align a large Czech-English parallel corpus and from the links of highest confidence a bilingual lexicon is extracted. Single-word translation equivalents are sorted by their significance. Lexicons of different sizes are extracted by taking top N translations. Precision of the lexicons is evaluated automatically and also manually by judging random samples. Detailed record
	Czech Reality in Chinese Language of the Newsletter of Chinese Living in the Czech Republic (Jiéhuá tōngxùn) Mirková, Renata ; Zádrapa, Lukáš (advisor) ; Pavlík, Štěpán (referee) This thesis is focused on Czech realia in Chinese language in a magazine called Newsletter of the Chinese Living in the Czech Republic Jiéhuá tōngxùn 捷华通讯. The analysis of the collected material is introduced by chapters about Chinese lexicon, word- formation and loan-words. Material referring to the Czech realia from selected issues of the magazine is divided into theme groups. Material is analyzed and compared with the common Chinese lexical system. Material includes not only loan-words, but also common Chinese words, which are used for describing Czech realia. At the end of the thesis, there are described some tendencies in terming Czech realia. Collected material is in the attachment of this thesis and could be used as a dictionary. Key words Czech reality, Czech-Chinese Journal, Chinese, Chinese living in the Czech Republic, lexicon, word, word-formation, loan-words Detailed record
	Automatic linking of lexicographic sources and corpus data Bejček, Eduard ; Lopatková, Markéta (advisor) ; Horák, Aleš (referee) ; Žabokrtský, Zdeněk (referee) Along with the increasing development of language resources - i.e., new lexicons, lexical databases, corpora, treebanks - the need for their efficient interlinking is growing. With such a linking, one can easily benefit from all their properties and information. Considering the convergence of resources, universal lexicographic formats are frequently discussed. In the present thesis, we investigate and analyse methods of interlinking language resources automatically. We introduce a system for interlinking lexicons (such as VALLEX, PDT-Vallex, FrameNet or SemLex) that offer information on syntactic properties of their entries. The system is automated and can be used repeatedly with newer versions of lexicons under development. We also design a method for identification of multiword expressions in a parsed text based on syntactic information from the SemLex lexicon. An output that verifies feasibility of the used methods is, among others, the mapping between the VALLEX and the PDT-Vallex lexicons, resulting in tens of thousands of annotated treebank sentences from the PDT and the PCEDT treebanks added into VALLEX. Powered by TCPDF (www.tcpdf.org) Detailed record
	News Feed Classifications to Improve Volatility Predictions Pogodina, Ksenia ; Šopov, Boril (advisor) ; Červinka, Michal (referee) This thesis analyzes various text classification techniques in order to assess whether the knowledge of published news articles about selected companies can improve its' stock return volatility modelling and forecasting. We examine the content of the textual news releases and derive the news sentiment (po larity and strength) employing three different approaches: supervised machine learning Naive Bayes algorithm, lexicon-based as a representative of linguistic approach and hybrid Naive Bayes. In hybrid Naive Bayes we consider only the words contained in the specific lexicon rather than whole set of words from the article. For the lexicon-based approach we used independently two lexicons one with binary another with multiclass labels. The training set for the Naive Bayes was labeled by the author. When comparing the classifiers from the machine learning approach we can conclude that all of them performed similarly with a slight advantage of the hybrid Naive Bayes combined with multiclass lexicon. The resulting quantitative data in form of sentiment scores will be then incorpo rated into GARCH volatility modelling. The findings suggest that information contained in news feeds does bring an additional explanatory power to tradi tional GARCH model and is able to improve it's forecast. On the... Detailed record
	Comparison of a traditional dictionary description and a corpus of written Czech with regard to semantic prosody Vovchuk, Oleksandr ; Cvrček, Václav (advisor) ; Hudousková, Andrea (referee) Czech dictionaries were created in the pre-corpus era; it is thus clear that some of their entries don't take into account semantic prosody - the fact that some lexemes occur in particular contexts (consequences are always far-reaching or catastrophic, while intention can be both evil and noble). The aim of this thesis is to compare selected parts of a dictionary (randomly selected probe into Czech adjectives) with corpus material, define the extent of missing information in the current state of Czech language description and explore, how many per cent of information about entries is missing in contemporary dictionaries. We based our research on Slovník spisovného jazyka českého (Dictionary of Written Czech) and representative corpuses of written language SYN2005, or else SYN2010. Context analysis was carried out by means of statistical methods and collocation rates. The difference between dictionary definitions and information inferred from the corpus research could become a further guideline for creating a new dictionary (besides adding a whole range of new entries that are still missing in Czech dictionaries). Keywords lexicon, corpus, semantic prosody Detailed record
	Valency of Verbs in the Prague Dependency Treebank Urešová, Zdeňka ; Hajičová, Eva (advisor) ; Lopatková, Markéta (referee) ; Ondrejovič, Slavo (referee) Title: Valency of verbs in the Prague Dependency Treebank Author: PhDr. Zdeňka Urešová Department: Institute of Formal and Applied Linguistics MFF UK Supervisor: Prof. PhDr. Eva Hajičová, DrSc. Abstract: This dissertation describes PDT-Vallex, a valency lexicon of Czech verbs, and its relation to the annotation of the Prague Dependency Treebank (PDT). The PDT-Vallex lexicon was created during the an- notation of the PDT and it is a valuable source of verbal valency information available both for linguistic research and for computer- ized natural language processing. In this thesis, we describe not only the structure and design of the lexicon (which is closely related to the notion of valency as developed in the Functional Generative De- scription of language) but also the relation between the PDT-Vallex and the PDT. The explicit and full-coverage linking of the lexicon to the treebank prompted us to pay special attention to diatheses; we propose formal transformation rules for diatheses to handle their surface realization even when the canonical forms of verb arguments as captured in the lexicon do not correspond to the forms of these arguments actually appearing in the corpus. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English